Exploring Text Virality in Social Networks 
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Abstract 

This paper aims to shed some light on the concept of viral- 
ity - especially in social networks - and to provide new in- 
sights on its structure. We argue that: (a) virality is a phe- 
nomenon strictly connected to the nature of the content being 
spread, rather than to the influencers who spread it (b) viral- 
ity is a phenomenon with many facets, i.e. under this generic 
term several different effects of persuasive communication 
are comprised and they only partially overlap. To give ground 
to our claims, we provide initial experiments in a machine 
learning framework to show how various aspects of virality 
can be independently predicted according to content features. 



Introduction 

Analyzing and recognizing the persuasive impact of com- 
munication is of paramount importance in many the- 
oretical and applied contexts. For example. Persuasive 
NLP focuses on various techniques such as crowdsourcing 
( Guerini, Strapparava, and Stock 2010) and co rpus based 
methods (IGuerini, Strapparava, and Stock 2008 to address 
the various effects which persuasive communication can 



have in different contexts on different audiences. 

In this scenario "virality" and data collected from social 
networks are burning topics for activities such as Viral Mar- 
keting - a set of marketing techniques using social networks 
to achieve various marketing objectives both in monitoring 
and promotion. Buzz monitoring, for example, is the mar- 
keting technique for keeping track of consumer responses 
to services and products. It involves checking and analyz- 
ing online sources such as forums, blogs, etc. in order to 
improve efficiency and reaction times. Moreover, identify- 
ing positive and negative customer experiences (white and 
black buzz), can help to assess product and service demand, 
tackle crisis management, foster reputation online, etc. On 
the other side, as a promotional strategic goal. Viral Market- 
ing campaigns can have to sustain discussion to get visibility 
for their product or to provoke controversial reactions to the 
brand (e.g. to strengthen the membership feeling). 



Generally speaking, virality refers to the tendency of 
a content to spread quickly in a community by word-of- 
mouth. In the spreading process there are several elements 
at play, e.g. the nature of the spreader and of the audience, 
the structure of the network through which the information 
is moving and the nature of the content itself. 

In this paper, we will mainly focus on the last 
point. We will first argue that virality hinges on the 
nature of the viral content itself, and that it can be 
decomposed into several components. Then, we will 
provide some preliminary experiments, in a machine 
learning framework, using the Digg dataset described 
in ( [Paltoglou, Thelwall, and Buckley 2010) l. These experi- 
ments show how the various components of virality can be 
separately predicted using just the wording of the content 
being spread. 



Related Work. Several researchers studied in- 
formation flow, community building and sim 
ilar tasks using Digg 
(ILer man and Ghosh 20101 



as 



a reference dataset 



Jamali and Rangwala 2009 



Khabi n^Hsu, and Caverlee 2009t 

Aaditeshw ar Seth and Cohen 20081 1. However, the great 

majority considers only features related to the network itself 

or simple popularity metrics of stories (e.g. number of digg s, 

number of comments), without analyzing the correlations 

of these aspects with the actual content spreading within the 

network. 

A notable work using Digg datasets that began to inves- 
tigate some of the insights we proposed is (I Jamah 20091 1. 
It incorporates features, derived from sentiment analysis of 
comments, to predict the popularity of stories. Finally, the 



work presented in ( Berger and Milkman 2009 is probably 



Copyright © 2013, Association for the Advancement of Artificial 
Intelligence (www.aaai.org). All rights reserved. 



the closest to the approach proposed in this paper It uses 
New York Times articles to examine the relationship between 
the emotion evoked by content and virality, using semi- 
automated sentiment analysis to quantify the affectivity and 
emotionality of each article. Still, their virality metric only 
consists of how many people emailed the article, which is an 
interesting but very hmited metric. 



Virality characteristics 

Virality is a social phenomenon in which there are no "im- 
mune carriers'^. That is to say, a content is either viral or 
not. If a message is viral, it will immediately spread, by no 
chance it will remain latent, waiting to activate. 

This means that although the study of epidemiology and 
information spreading in social networks is important, it 
does only partially account for the phenomenon of content 
virality. Individuating the central nodes (i.e. the influencers) 
of a social network is certainly necessary if we want our con- 
tent to spread quickly to a significant audience, but it will 
not grant its spreading to further nodes. The book "The tip- 
ping point" dGladwell 20021 ) maintains that influencers have 
a central role in the spreading of a fashion or a product - 
even to further nodes - and this seems to contradict our idea. 
However, GladwelFs standpoint has also been criticized by 
some researchers (IWatts and D odds 2007 1. 

Thus, we can reformulate our standpoint in a less contro- 
versial way: Social network structure analysis accounts for 
how content spreads, rather than why. Accordingly, our first 
aim is to investigate the characteristics of the content which 
make it viral. 

In this work we consider "virality" of text-based contents, 
focusing on the various effects created within an online com- 
munity. Still, our analysis method can also be applied to 
other kinds of contents such as images and video-clips. 

Traditional approaches based on "popularity metrics" 
(e.g. the number of IJike) are not sufficient for modeling 
a complex phenomenon such as "virality" of a text. First of 
all, let us try to distinguish different phenomena: 

• Virality: in its basic form it refers to the number of people 
who accessed a given content in a given time interval. This 
is the most generic definition of virality. 

• Appreciation: how much people like a given content, for 
example by clicking an IJike button. 

• Spreading: how much people tend to share this content by 
forwarding it to other people. 

• Simple buzz: how much people tend to comment a given 
content. 

• White buzz: how much people tend to comment in a posi- 
tive mood (e.g. "The best product I have ever bought"). 

• Black buzz: how much people tend to comment in a nega- 
tive mood (e.g. "Do not buy this product, it is a rip-off"). 

• Raising discussion: the ability to induce discussion 
among users. 

• Controversiality: the ability to split the audience in differ- 
ent parties (usually pro and against the given content). 

While many of the above definitions are straightforward, 
some of them deserve a little more discussion (i.e. black 
buzz, raising discussion, controversiality). 

A paradigmatic example of black buzz is the "Bonsai Kit- 
ten" hoax. In 2000 a website suggested that kittens were be- 
ing subjected to horrific mistreatment to shape their bodies. 



as if trees were being shaped into bonsai. The "bonsai kitten" 
hoax was widely believed and provoked a storm of outrage 
and negative word of mouth reactions throughout the world, 
up to a point that FBI was compelled to investigate the Web 
site. Similarly, but less outrageously, the recent iPhone 4 an- 
tenna problem provoked a lot of talks on reception issues. 
For a while Apple ignored the issue, but the black buzz 
grew so quickly that Apple CEO, Steve Jobs, had to hold 
a press conference to address the problem promising a free 
of charge solution. These examples show how a content can 
spread exactly because it is disliked. 

Some well know examples of raising discussion refer to 
movie promotion, namely "The Blair witch project" and 
"Cloverfield". Both movies managed to became a cultural 
phenomenon long before their release. Just consider that the 
discussion about the movie "The Blair witch project" rose 
a $30,000 film into a $150 miflion blockbuster. All these 
discussions were triggered by two very simple but powerful 
human characteristics: curiosity and mystery. People were 
discussing about its reality and trying to guess whether all 
the things shown in the movie had really happened. 

As another example: "Medal Of Honor" is a video game 
set during the current Afghanistan war. The controversy 
raised about the option, for players, to play also as TaUban, 
splitting the audience in pro (claiming that it was only a 
game) and against (claiming that this was harmful and disre- 
specting for fallen Allied soldiers). The difference of opin- 
ions about the product was so strong that it echoed world- 
wide. More generally, there are topics which are intrinsically 
controversial such as death penalty and abortion. 

Dataset, Metrics and Experiments 

In this section, we formalize some of the aforemen- 
tioned phenomena using the Digg dataset described in 



(Paltoglou, Thelwall, and Buckley 2010). This corpus al- 
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lows us to define and analyze many of the above definitions 
in a unique framework. These formalizations (metrics) rep- 
resent an initial attempt to model phenomena which have not 
been addressed before, and they can be refined as a future 
work. Starting from these metrics we then build specialized 
datasets for our experiments. 

Digg Dataset. Digg is a social bookmarking website where 
people can share, vote and comment "stories". Stories 
typically contain a title, a short textual description 
(snippet) and a link to the original web content. In this 
dataset the average length of the title is 7 words, the av- 
erage length of snippet is 31 words. Users are allowed 
to publicly approve submitted stories with a process called 
"digging", and publicly approve or disapprove comments on 
submitted stories (called "DiggUp" and "DiggDown"). The 
data gathered from the site is a complete crawl spanning 
the months February, March and April 2009. Some statis- 
tics concerning the gathered data include: 1,195,808 sub- 
mitted stories; 135,367 stories with at least one comments; 
1,646,153 individual comments; 877,841 users who submit- 
ted a story/comment; 14,376,742 approvals (i.e. diggs). 

Appreciation. Appreciation is connected to the number of 



diggs a story receivecQ. The formula for appreciation is: 

A = ND (A) 

where ND is the number of diggs for a given story. 
Dataset: we considered as appreciated stories those receiv- 
ing more than 100 diggs (i.e. A > 100). In the dataset 
there are 16660 stories with this characteristic. Then, we 
randomly extracted an equal number of unappreciated sto- 
ries (i.e. or 1 digg) to obtain a dataset of 33320 instances. 

Buzz and Spreading. Within the Digg community, the same 
story can be submitted only once. Therefore it is difficult to 
model the concept of spreading. In our view, the number 
of different users commenting on a story is a good hybrid 
measure to model buzz and spreading. The formula for Buzz 
and Spreading is: 

BS = NUC (BS) 

where NUC is the number of such different users. 
Dataset: we considered as buzzed only stories which have a 
BS > 100. There are 3270 examples of buzzed stories, and 
we randomly extracted an equal number of non buzzed sto- 
ries (i.e. comments) to obtain a dataset of 6540 instances. 

Raising-discussion. The number of comments alone do not 
state whether there has been a discussion among users about 
a story; we need to distinguish low level comments (i.e. ex- 
plicit replies to other comments) from top level ones. The 
formula for raising discussion is: 



RD = (NCl/NCt) * NUC 



(RD) 



where NCt is the number of comments for a given story 
and NCl is the number of (low-level) comments, i.e. com- 
ments which are replies to other comments. 
Dataset: we considered as positive examples only those sto- 
ries which have a RD score > 50. Accordingly, we collected 
3786 examples of stories which raised discussion, and then 
we randomly extracted an equal number of negative exam- 
ples to obtain a dataset of 7572 instances. 

Controversiality. The problem of opinion controversial- 
ity has received little attention, a notable exception being 
( [Carenini and Cheung 2008) . While their model comprises 
multiple classes (i.e. votes in a -3 to +3 scale), our model 
is binary: only "I like" (+1) or "I don't like" (-1) votes are 
allowed. Since stories cannot receive negative votes in Digg, 
we decided to consider the "DiggsUp" and "DiggsDown" of 
the associated comments. If the highest number of positive 
votes among comments is A and the highest number of neg- 
ative votes is B, the formula for controversiality is: 



C = min{A, B)/max{A, B) 



(C) 



min{A, B) denotes the smaller value of A and B, and 
max{A, B) denotes the larger of the two. This measure 
ranges from a minimum of (total agreement, everybody 
who voted either liked or disliked the comments) to a maxi- 
mum of 1 (highest controversiality, the sample votes splitted 
exactly into two). 



Dataset: we considered as controversial stories only those 
which have a controversial score (|C} > 0.9. We collected 
3315 examples of controversial stories, and we randomly 
extracted an equal number of non controversial stories to ob- 
tain a dataset of 6630 instances. 

Datasets preprocessing. We focused only on the text 
present in the stories. As features we solely considered the 
words contained in the title and in the snippet of the 
story itself. To reduce data sparseness, we PoS-tagged all the 
collected words (using TreeTagger (ISchmid 1994) and con- 
sidered the lemmas only for content words, i.e. nouns, verbs, 
adjectives and adverbs). In the experiments, we did not make 
any frequency cutoff or feature selection. 

Experiments. We conducted a series of preliminary ex- 
periments, in a machine learning framework, to explore the 
feasibility of independently predicting the impact of stories 
according to the various metrics proposed. For all the exper- 
iments we exploited Support Vector Machines framework, 
in particular we used SVM-light under its default settings 
(Joachims 1998). For every metric we conducted a binary 
classification experiment with ten-fold cross validation on 
the corresponding dataset. Note that all datasets are bal- 
anced, i.e. 50% of clearly positive examples and 50% of 
clearly negative examples - according to the metric under 
investigation: this accounts for a random baseline of 0.5. 





Fl measure 


Appreciation 


0.78 


Buzz 


0.81 


Controversiality 


0.70 


Raising-Discussions 


0.68 



Table 1 : Results for the classification tasks 



Discussion 

As far as the performance is concerned, we can state that 
the simpler metrics (namely buzz and appreciation) have 
much higher Fl (see Table [T]!. This can be explained by the 
fact that they represent simpler and more direct indicators 
of audience reaction to a given content. Still, also complex 
metrics (i.e. raising-discussion and controversiality) yield a 
good Fl, which suggests that these phenomena can be auto- 
matically predicted as well, and that they are highly corre- 
lated to (and provoked by) the content being spread. 

Next step is to understand to which extent the various 
viral phenomena are independent. If we consider Table |2] 
(where the percentage of the stories of a class overlapping 
with another is given) we can see that buzz and apprecia- 
tion are the datasets with the greatest intersection (77% of 
buzzed stories are also "appreciated"). The correlation (and 
predictabihty) between the two phenomena has been in the 
focus of some research that confirms their interdependencies 
( Jamali and Rang wala 20091. Going into details of Table |2] 
we see that: 



A digg represents an acknowledgement of a story, regardless 
of being positive or negative (e.g. in the Bonsai Kitten example). 



• Appreciated stories overlap quite similarly with buzz and 
raising-discussion stories (15.1% and 14.8%) while the 
controversial ones are the most independent (only 4.2%). 



• For buzzed stories we see a high relationship with the ap- 
preciated ones (as stated), and a significant overlapping 
with raising-discussion ones (51,7%), but also in this case 
a very low overlapping with controversial (only 3.1%). 

• For controversial stories we already saw a low relation- 
ship with appreciation and buzz (21.4% and 3.0% of 
the controversial stories) but a higher overlapping with 
raising-discussion ones (48.6%). 

• Raising-discussion stories, instead, seem to overlap quite 
equally with the three other classes. 





App 


Buzz 


Cont 


Rais 


Appreciation 


- 


15,1% 


4,2% 


14,8% 


Buzz 


77,0% 


- 


3,1% 


51,7% 


Controversial 


21,4% 


3,0% 


- 


48,6% 


Raising 


65,0% 


44,6% 


42,5% 


- 



Table 2; Percentage of stories overlapping with another class 

White and Black Buzz. We tried to investigate the issues 
regarding white and black buzz. The Digg corpus contains, 
for each comment, a BinaryEmotion field, which yields 
the results of an automatic classifications using a Lexicon- 



based approach; see (Paltoglouet al. 2010 1 for further de- 
tails. The BinaryEmotion field indicates whether the 
comment is considered as positive, negative or neutral. The 
formulae for White Buzz and for Black Buzz, using the 
BinaryEmotion field on the comment, are: 

WB = {Positive > (Neutral + Negative)) (WB) 

BB = {Negative > {Neutral + Positive)) (BB) 

where Negative, Neutral and Positive indicate the number 
of comments for a given story, categorized as negative, neu- 
tral or positive, accordingly. We calculated the WB and BB 
for the stories with NUC > 100. According to the formu- 
lae proposed previously, we found 254 stories raising white 
buzz, and 1499 stories raising black buzz. These examples 
are too few for a machine learning approach and, further- 
more, since the BinaryEmotion field is automatically 
calculated, results may not be very reliable. Nonetheless, 
it seems that a negative mood (black buzz) is predominant 
in the dataset. This predominance of black buzz can be ac- 
counted for by considering that the DIGG structure does not 
allow the expression of "diggsDown" for stories. So people 
could be more inclined to explicitly express (with comment) 
their dislike. Anyway this issue deserves further exploration. 

Conclusions 

In this paper, we argued that content virality, in social net- 
works, hinges on the nature of the viral content itself, rather 
than on the simple structure of the social network. We fur- 
ther argued that virality is a complex phenomenon which can 
be decomposed into several components. We defined and de- 
scribed many of these viral components, namely apprecia- 
tion, spreading, buzz, white and black buzz, raising discus- 
sion and controversiality. We then provided experiments, in 



a machine learning framework, which showed how the vari- 
ous viral phenomena can be separately predicted using just 
the wording of the content being spread, and analysed the 
interdependencies amongst them. 

As a future study we will deeply explore black and white 
buzz, possibly experimenting on other annotated datasets, 
and incorporating as features the emotions evoked by the 
stories (e.g. "fear", "joy", "anger"). Moreover, we are going 
to extract "viral" lexicons to be used in applicative scenarios, 
such as generating messages for marketing campaigns. 
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